Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation
نویسندگان
چکیده
Neural dialogue models suffer from low-quality responses when interacted in practice, demonstrating difficulty generalization beyond training data. Recently, knowledge distillation has been used to successfully regularize the student by transferring teacher. However, teacher and are trained on same dataset tend learn similar feature representations, whereas most general should be found through differences. The finding of is further hindered unidirectional distillation, as obey may discard some that truly but refuted To this end, we propose a novel framework, where learning more line with idea reaching consensus, i.e., common beneficial different yet all datasets diversified partners. Concretely, task divided into group subtasks number students. Each assigned one subtask not only optimized allocated also imitates multi-view representation aggregated other students (i.e., peers), which induces capture among alleviates over-fitting subtasks. enhance generalization, extend bidirectional encourages its peers co-evolve exchanging complementary each other. Empirical results analysis demonstrate our framework effectively improves model without sacrificing efficiency.
منابع مشابه
Data Distillation for Controlling Specificity in Dialogue Generation
People speak at different levels of specificity in different situations.1 A conversational agent should have this ability and know when to be specific and when to be general. We propose an approach that gives a neural network–based conversational agent this ability. Our approach involves alternating between data distillation and model training : removing training examples that are closest to th...
متن کاملMulti-view Feature Learning with Discriminative Regularization
More and more multi-view data which can capture rich information from heterogeneous features are widely used in real world applications. How to integrate different types of features, and how to learn low dimensional and discriminative information from high dimensional data are two main challenges. To address these challenges, this paper proposes a novel multi-view feature learning framework, wh...
متن کاملWeighted Multi-view Clustering with Feature Selection
In recent years, combining multiple sources or views of datasets for data clustering has been a popular practice for improving clustering accuracy. As different views are different representations of the same set of instances, we can simultaneously use information from multiple views to improve the clustering results generated by the limited information from a single view. Previous studies main...
متن کاملOn multi-view feature learning
Sparse coding is a common approach to learning local features for object recognition. Recently, there has been an increasing interest in learning features from spatio-temporal, binocular, or other multi-observation data, where the goal is to encode the relationship between images rather than the content of a single image. We provide an analysis of multi-view feature learning, which shows that h...
متن کاملGeneration for Dialogue Translation Using Typed Feature Structure Unification
This a r t i c l e i n t r o d u c e s a b i d i r e c t i o n a l g r a m m a r genera t ion sys tem called fea ture s t ructure-directed generat ion, developed for a d ia logue t rans la t ion sys tem. The sys tem utilizes typed feature structures to control the top-down derivation in a declarative way. This generation system also uses disjunctive feature structures to reduce the number of co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i14.17516